<p>Vowpal Wabbit handles learning problems with any number of sparse features. It is the first published tera-scale learner<sup><a class="cite_sup" href="#DBLP:journals/corr/abs-1110-4198">[1]</a></sup> achieving great scaling. It features distributed, out-of-core learning and pioneered the hashing techniques<sup><a class="cite_sup" href="#Shi:2009:HKS:1577069.1755873">[2]</a> <a class="cite_sup" href="#DBLP:journals/corr/abs-0902-2206">[3]</a></sup>, which together make its memory footprint bounded independent of training data size.</p>

<div class="hidden">
  <ol class="bibliography"><li><div class="bib_content" id="DBLP:journals/corr/abs-1110-4198">
  Alekh Agarwal and
               Olivier Chapelle and
               Miroslav Dudı́k and
               John Langford,
  
    <a href="https://arxiv.org/abs/1110.4198">
  
    A Reliable Effective Terascale Linear Learning System

  
    </a>
  
  (2011)
</div>
<a class="details" href="/bibliography/DBLP_journals/corr/abs-1110-4198.html">Get .bib</a></li>
<li><div class="bib_content" id="Shi:2009:HKS:1577069.1755873">
  Shi, Qinfeng and Petterson, James and Dror, Gideon and Langford, John and Smola, Alex and Vishwanathan, S.V.N.,
  
    <a href="https://dl.acm.org/citation.cfm?id=1577069.1755873">
  
    Hash Kernels for Structured Data

  
    </a>
  
  (2009)
</div>
<a class="details" href="/bibliography/Shi_2009_HKS_1577069.1755873.html">Get .bib</a></li>
<li><div class="bib_content" id="DBLP:journals/corr/abs-0902-2206">
  Kilian Q. Weinberger and
               Anirban Dasgupta and
               Josh Attenberg and
               John Langford and
               Alexander J. Smola,
  
    <a href="https://arxiv.org/abs/0902.2206">
  
    Feature Hashing for Large Scale Multitask Learning

  
    </a>
  
  (2009)
</div>
<a class="details" href="/bibliography/DBLP_journals/corr/abs-0902-2206.html">Get .bib</a></li></ol>
</div>
